Learning Verbal Transitivity Using LogLinear Models

نویسندگان

  • Nuno Miguel Marques
  • José Gabriel Pereira Lopes
  • Carlos Agra Coelho
چکیده

Portuguese dictionaries lack information about the kind of phrases or clauses a verb goes with. In this paper we describe how this information can be computationally learned from an automatically tagged corpus with almost 10,000,000 words. Loglinear modeling for categorical data will be used to analyze and automatically identify the subcatego-rization dependencies. Loglinear models were also used in an unsuper-vised clustering algorithm to accurately determine the verbal transitive-ness behavior of 196 Portuguese verbs. Evaluation of the obtained results connrmed a precision measure of 96.07% precision and a recall of 68.81% over a standard portuguese dictionary.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Loglinear Clustering for Subcategorization Identification

In this paper we will describe a process for mining syntactical verbal subcategorization, i.e. the information about the kind of phrases or clauses a verb goes with. We will use a large text corpus having almost 10,000,000 tagged words as our resource material. Loglinear modeling is used to analyze and automatically identify the subcategorization dependencies. An unsupervised clustering algorit...

متن کامل

Non-Verbal Communication in Models of Communicative Competence and L2 Teachers’ Rating

Non-verbal communication (NVC) plays a major role in various aspects of human life (Andersen, 2004; Cameron, 2001; Johnstone, 2008). Children learning their first language come to realize non-verbal communication as their socialization process takes place (Fletcher & German, 1990; Ingram, 1996; Owens, 2001). However, most EFL learners may have little exposure to these non-verbal aspects of comm...

متن کامل

Item Bias Detection Using Loglinear Irt

A method is proposed for the detection of item bias with respect to observed or unobserved subgroups. The method uses quasi-loglinear models for the incomplete subgroup x test score x Item 1 × • • • x item k contingency table. If subgroup membership is unknown the models are Haberman's incomplete-latent-class models. The (conditional) Rasch model is formulated as a quasi-loglinear model. The pa...

متن کامل

Analysis of Transitivity and Reciprocity in Online Distance Learning Networks

The goal of this research is to explore whether network structures common in social networks also emerge in online distance learning networks. We compare the observed values of reciprocity and transitivity in 95 online distance learning networks and 40 social networks with predictions of Random Graph models. All the networks tested exhibit reciprocities that significantly deviate from the predi...

متن کامل

Item Bias Detection using the Log linear Rasch Model

A method is proposed for the detection of item bias with respect to observed or unobserved subgroups. The method uses quasi-loglinear models for the incomplete subgroup x testscore x item 1 x x itek k contingency table. If subgroup membership is unknown the models are Haberman's incomplete-latent-class models. The (conditional) Rasch model is formulated as a quasi-loglinear model. The parameter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998